Search for: All records

Creators/Authors contains: "Yang, Lin F."

« Prev Next »

Total Resources

10

Resource Type
Conference Paper

8

Conference Proceeding

0

Dataset

0

Journal Article

2

Workshop Report

0

Availability
Full Text / Resource Available

9

Citation Only

1

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Multi-Arm Bandits over Action Erasure Channels

https://doi.org/10.1109/ISIT54713.2023.10206591

Hanna, Osama A. ; Karakas, Merve ; Yang, Lin F. ; Fragouli, Christina ( June 2023 , IEEE International Symposium on Information Theory)

Free, publicly-accessible full text available June 25, 2024
Compression for Multi-Arm Bandits

https://doi.org/10.1109/JSAIT.2023.3260770

Hanna, Osama A. ; Yang, Lin F. ; Fragouli, Christina ( December 2022 , IEEE Journal on Selected Areas in Information Theory)

Full Text Available
Universal Streaming of Subset Norms

https://doi.org/10.4086/toc.2022.v018a020

Braverman, Vladimir ; Krauthgamer, Robert ; Yang, Lin F. ( January 2022 , Theory of Computing)

Full Text Available
Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal

Agarwal, Alekh ; Kakade, Sham ; Yang, Lin F. ( July 2020 , Proceedings of Machine Learning Research)

This work considers the sample and computational complexity of obtaining an $\epsilon$-optimal policy in a discounted Markov Decision Process (MDP), given only access to a generative model. In this model, the learner accesses the underlying transition model via a sampling oracle that provides a sample of the next state, when given any state-action pair as input. We are interested in a basic and unresolved question in model based planning: is this naïve “plug-in” approach — where we build the maximum likelihood estimate of the transition model in the MDP from observations and then find an optimal policy in this empirical MDP — non-asymptotically, minimax optimal? Our main result answers this question positively. With regards to computation, our result provides a simpler approach towards minimax optimal planning: in comparison to prior model-free results, we show that using \emph{any} high accuracy, black-box planning oracle in the empirical model suffices to obtain the minimax error rate. The key proof technique uses a leave-one-out analysis, in a novel “absorbing MDP” construction, to decouple the statistical dependency issues that arise in the analysis of model-based planning; this construction may be helpful more generally.
more » « less
Full Text Available
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?

Du, Simon S. ; Kakade, Sham M. ; Wang, Ruosong ; Yang, Lin F. ( January 2020 , International Conference on Learning Representations)

Modern deep learning methods provide effective means to learn good representations. However, is a good representation itself sufficient for sample efficient reinforcement learning? This question has largely been studied only with respect to (worst-case) approximation error, in the more classical approximate dynamic programming literature. With regards to the statistical viewpoint, this question is largely unexplored, and the extant body of literature mainly focuses on conditions which permit sample efficient reinforcement learning with little understanding of what are necessary conditions for efficient reinforcement learning. This work shows that, from the statistical viewpoint, the situation is far subtler than suggested by the more traditional approximation viewpoint, where the requirements on the representation that suffice for sample efficient RL are even more stringent. Our main results provide sharp thresholds for reinforcement learning methods, showing that there are hard limitations on what constitutes good function approximation (in terms of the dimensionality of the representation), where we focus on natural representational conditions relevant to value-based, model-based, and policy-based learning. These lower bounds highlight that having a good (value-based, model-based, or policy-based) representation in and of itself is insufficient for efficient reinforcement learning, unless the quality of this approximation passes certain hard thresholds. Furthermore, our lower bounds also imply exponential separations on the sample complexity between 1) value-based learning with perfect representation and value-based learning with a good-but-not-perfect representation, 2) value-based learning and policy-based learning, 3) policy-based learning and supervised learning and 4) reinforcement learning and imitation learning.
more » « less
Full Text Available
Planning with General Objective Functions: Going Beyond Total Rewards

Wang, Ruosong ; Zhong, Peilin ; Du, Simon S ; Salakhutdinov, Russ R ; Yang, Lin F. ( January 2020 , Annual Conference on Neural Information Processing Systems)
null (Ed.)
Full Text Available
Efficient Symmetric Norm Regression via Linear Sketching

Song, Zhao ; Wang, Ruosong ; Yang, Lin F ; Zhang, Hongyang ; Zhong, Peilin ( January 2019 , Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019)

Full Text Available
Clustering High Dimensional Dynamic Data Streams

Braverman, Vladimir ; Frahling, Gereon ; Lang, Harry ; Sohler, Christian ; Yang, Lin F. ( January 2018 , Proceedings of Machine Learning Research)

Full Text Available
Approximate Convex Hull of Data Streams

Blum, Avrim ; Braverman, Vladimir ; Kumar, Ananya ; Lang, Harry ; Yang, Lin F. ( January 2018 , The 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018))

Full Text Available
Streaming symmetric norms via measure concentration

https://doi.org/10.1145/3055399.3055424

Błasiok, Jarosław ; Braverman, Vladimir ; Chestnut, Stephen R. ; Krauthgamer, Robert ; Yang, Lin F. ( January 2017 , Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing)

Full Text Available